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•  -     ABSTRACT 

The  HF  channel  is  characterized  with  regard  to  frequenoy 
selective  fading  and  atmospheric  noise  under  restrictive 
mathematical  assumptions,  and  compared  with  measured 
statistics.   Actual  channel  characteristics  are  utilized  to 
obtain  parameters  for  choosing  appropriate  convolutional 
codes.   Short  threshold  decodable  convolutional  codes  with 
only  a  moderate  amount  of  interleaving  are  shown  to  be 
capable  of  affecting  a  reduction  in  the  output  error 
probability  by  at  least  an  order  of  magnitude,  for  uncoded 
error  probabilities  on  the  order  of  2  x  10 
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I.   INTRODUCTION 

Radio  communication  in  the  frequency  range  from  2-30  MHz 
has  been  common  for  many  years.   However,  the  reliability  of 
such  a  channel  is  not  high  enough  for  modern  high  speed 
computer  to  computer  communications.   In  order  to  increase 
the  reliability  of  this  form  of  communications,  the  nature 
of  the  channel  must  be  understood,  or  at  least  modeled  in 
a  fashion  that  is  a  reasonable  approximation  to  the  situa- 
tion.  Once  this  is  accomplished,  various  solutions  may  be 
attempted  to  obtain  optimal  reliability  via  the  channel. 

To  describe  the  situation  in  general  terms,  the  trans- 
mitter sends  a  signal  through  the  channel  to  a  receiver. 
However,  due  to  various  causes,  the  signal  received  may  not 
be  the  one  sent,  in  which  case  an  error  is  said  to  have 
occurred.   These  errors  result  from  two  basically  different 
causes:   1)  signal  distortion,  and  2)  noise.   Noise  is 
considered  throughout  this  report  to  be  completely  unrelated 
to  the  signal,  while  signal  distortion  is  considered  as  a 
perturbation  of  the  signal  and  thus  is  related  to  the 
signal.   As  an  example,  if  a  portion  of  the  signal  energy 
is  delayed  so  that  it  arrives  slightly  later  than  the  rest 
of  the  signal,  interference  will  result.   If  the  interference 
is  destructive,  the  total  received  signal  will  be  less  and 
the  signal  is  said  to  fade. 

The  first  section  is  devoted  to  a  detailed  discussion 
of  the  signal  and  one  common  form  of  noise  associated  with 
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the  channel.   The  second  section  discusses  convolutional 
codes  in  general  and  applies  them  to  the  channel  as  charac- 
terized in  the  first  section.   It  is  shown  that  it  is 
possible  to  obtain  several  orders  of  magnitude  improvement 
in  the  probability  of  not  making  an  error. 
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II.   THE  SIGNAL  ENVIRONMENT 

A.   INTRODUCTION 

The  HF  channel  is  difficult  to  analyze  in  general  due 
to  the  number  and  complexity  of  the  various  parameters. 
The  channel  is  time  varying  over  a  period  of  hours,  days, 
months,  or  years.   (The  eleven  year  sun  spot  cycle  greatly 
effects  parameters  such  as  Maximum  Usable  Frequency  (MUF) , 
Lowest  Useful  Frequency  (LUF)  and  absorption.)   In  general 
the  HF  communication  channel  is  not  normally  line  of  sight, 
this  implies  that  there  is  some  form  of  'bending'  involved 
with  the  path  of  propagation  of  the  electromagnetic  radia- 
tion.  This  'bending'  results  from  two  principal  sources: 
1)  refraction  caused  by  a  gradient  in  the  dielectric 
constant  of  the  medium,  and  2)  reflection  caused  by  an 
abrupt  discontinuity  in  the  dielectric  constant.   Refraction 
normally  accounts  for  a  considerable  amount  of  the  signal 
energy  received  and,  is  commonly  referred  to  'as  the  specular 
component . 

In  the  way  of  a  basic  review,  the  upper  atmosphere 
becomes  ionized  in  various  regions  from  ultraviolet  and  cor- 
puscular radiation  from  the  sun  [1] .   Incident  electro- 
magnetic radiation  is  refracted  by  these  levels  and  if  the 
operating  or  carrier  frequency  is  properly  chosen  the 
radiated  signal  will  return  to  earth  at  the  receiver.   The 
amount  of  energy  received  is  dependent  on  many  factors 
ranging  from  antenna  polarization  to  angle  of  launch  or 
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reception  and  the  exact  state  of  the  ionisphere.   Frequently 
multiple  paths  will  be  possible  and  the  resulting  signals 
at  the  receiver  may  add  either  constructively  or  destruc- 
tively.  In  addition  to  the  refraction,  some  of  the  incident 
radiation  may  be  scattered  from  irregularities  in  the  ionis- 
phere and  this  will  also  be  received.   Figures  2.1  and  2.2 
illustrate  the  basic  phenomena.   Hence  the  received  signal 
is  the  sum  of  the  scatter  and  spectral  signals.   The 
probability  characteristics  of  this  type  of  signal  have  been 
well  treated  by  Nesenbergs  [2]  and  are  included  here  only 
for  a  review. 


F  Region 


Transmitter 

Figure  2.1.   Source  of  specular  components. 

Paths  which  result  in  the  reception  of  two 
specular  components  at  the  receiver.   The 
refraction  of  the  incident  electromagnetic 
energy  is  a  result  of  the  gradient  of  ion 
density  as  a  function  of  height. 
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Irregularities 
in  Ionisphere 


Transmitter 


Figure  2.2.   Source  of  the  Scatter  Com- 
ponents.  The  scatter  components 
result  from  the  reflection  from 
many  relatively  abrupt  discontinuities 
in  the  ionosphere.   The  discontinuities 
may  be  due  to  numerous  causes  such  as 
meteor  trails  or  turbulance. 


Receiver 


For  a  given  path  length,  a  relatively  straight  forward 
computational  procedure  may  be  followed  to  give  the  maximum 
usable  frequency,  and  the  relative  amplitude  of  other 
specular  components  which  may  be  presented  with  the  fraction 
of  time  these  components  may  be  expected.   Hence,  for  a  given 
operating  frequency,  a  rough  approximation  may  be  made  with 
regard  to  the  number  of  specular  components.   This,  we  shall 
see,  will  affect  the  bit  error  probability.   Table  I  is  a 
set  of  sample  predictions  (from  Ref.  2)  indicating  the 
specular  components  probable  together  with  the  percentage 
of  times  they  were  expected  to  be  present. 

It  is  apparent  that  this  system  is  quite  complicated, 
so  let  us  begin  by  making  some  restrictive  assumptions  which 
will  facilitate  the  mathematics. 
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We  assume  that  the  HF  channel  under  consideration  is 
stationary.   While  this  is  not  strictly  true,  over  the 
period  of  several  minutes  it  is  a  very  reasonable  approxi- 
mation.  A  second  helpful  assumption  is  that  all  the 
components  (scatter  and  specular)  are  mutually  statistically 
independent.   This  assumption  will  permit  us  to  use  char- 
acteristic functions  and  simplify  our  task.   (The  number  of 
modes  possible  may  be  determined  by  the  relationship  of  the 
carrier  frequency  with  the  maximum  usable  frequency,  however, 
if  the  modes  exist  their  effects  may  be  statistically 
independent . ) 

It  will  also  be  assumed  in  this  section  that  the  environ- 
mental noise  is  gaussian.   In  a  later  section  we  will  con- 
sider in  some  detail  a  recent  model  for  atmospheric  noise 
and  indicate  the  modifications  which  result. 

B.   SIGNAL  AMPLITUDE  CHARACTERISTICS 

We  begin  by  considering  the  received  signal  as  two 
dimensional,  where  the  two  dimensions  will  be  the  inphase 
and  quadrature  components  of  the  signal.   We  will  have 
occasion  to  consider  two  basic  types  of  signals  which  may 
be  present  at  the  same  time,  hence  we  define  a  signal  to 
be  specular  if  it  is  of  fixed  amplitude,  although  the  phase 
may  still  be  random.   A  second  type  of  signal,  called 
scatter ,  is  characterized  by  a  symmetric  bivariate  gaussian 
distribution  which  for  mathematical  convenience  has  equal 
inphase  and  quadrature  variance.   We  define  the  probability 
distribution  function  (DF)  of  the  random  variable  X  as 


l  n 


F(x)  =  Prob  (X  <  x)  _oo<x<oo 

with  the  corresponding  probability  density  function  (pdf) 

f(x)  =  mzi . 

dx 

It  will  be  convenient  to  define,  in  the  standard  fashion, 
the  characteristic  function 


CU)  =  E[ejCx] 


OO      .  j. 

=  /   eJ^x  dP(x) 


OO      .  _ 

=  /   eJ^x  f(x)dx  . 

_oo 


where  E[«]  denotes  the  expected  value. 

For  our  purpose,  we  consider  the  two  dimensional  case  where 

F(x,y)  =  Prob(X  <  x,  Y  <  y) 

with  the  corresponding  joint  probability  density  function 
f(x,y)  and  characteristic  function 

OO     OO    .  /  -  s 

C(£,n)  =  /   /  ejUX  ny;f(x,y)dxdy  .  (1) 


_oo  _oo 


Let  us  consider  a  typical  specular  component, 

♦l 

V.  =  X.  +  jY.  =  |V. I  eJ 

1     1    °    l    '  i ' 

where  X.  is  the  cosine  component,  Y.  the  sine  component, 
|V.  |  is  the  envelope  and  <J)  the  phase  of  V.  .   Let  us  assume 
that  the  phase  is  uniformly  distributed  over  (0,2tt),  and 
independent  of  |V. | .   Since  we  are  dealing  with  a  specular 
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component,  which,  by  definition,  has  a  constant  amplitude, 
we  have 


Vj  |vA„y,   -  vy    y    A±  -r  Jr±  -  x-±  j, 


f  h7.  |  (x,y)  =  S(  /x.,  +  yf  -  r 


where  x.  and  y.  are  the  inphase  and  quadrature  components 


respectively  and  r.  =  /x.  +  y 


,  =  /x,  + 


f.  (<t>.)  =  o^- 


and  since   V.   and  4.  are  independent 

flvJ,4>,(p''t')  =  f|V«|(p)'V  +  )  • 

1  i|5Ti         '  i '      Yi 

We  desire  the  pdf  of  V.,  which  may  be  expressed  as 

//fivi  ApA)    cipdA  =  //  fv  (x,y)  dxdy 
v$  I v| ,9  XY    l 

Changing  variables  to  x  =  pcos4,  y  =  psin4,  we  have  the 
Jacobian 

|j|  -  |p| 

so  we  now  have 

//    fi,,i     ,(p,4)    dpd4   =    //    pf,r    (pcos<J),    psin4)    dpd<j> 
p$      \V\>V  p$        vi 

from  whence 


yi     ,(p,<f)    =    pfy    (pcoscj),    psin4) 


=    pfy    (x,y) 
i 


and 


fv  (*,y)  =         x  fivi  (/?+y2  I  (2) 


2tt  /x   +y 


«      /"2.     2 
/x 


From  equation  1  we  may  write  the  characteristic  function 
for  the  specular  component  as 


c,U,n)  =  /  / 


j(£x+ny)  A/  2^   2     „.)    ,  , 


_00   _00 


i   /  2  ,  2 
it  /  x  +y 


Again  letting  x  =  pcos<}>,  y  =  psin<r>;  the  Jacobian  is  again 
p  and 

2tt  oo     jp(£cos<j>+nsin<jO(uD_r.  ) 
C,(C,n)   =   /     / P     x      -|p|dpd4» 

1  0    0  27Tp 


The  exponent  may  be  reduced  by  noting 

. / 

/2      2 

£cos<r>   +   rising   =     / £  +ri 


coscf)+    •  sine}) 


/c2.     2  /r2,     ; 


=   /r2.  2  cos(6+a) 
=  c  coscf)' 
where  c  =  A-2 ,  2  and  (J> '  is  again  uniform  on  (0,2tt),  hence, 

oo  -.    TT    .         i  | 

C,U,n)  =  C,(C)  =/  6(p-r.  )•  ±  /   e^pC0S<f)  d((>'dp 
i         i      Q       i      0 


From  Reference  3,  item  9.1.21 


-n   tt 


J  (x) 
n 


(«]  )      r  J  XCOS0     /   oN  ,. 

=  -^^ /   eJ      cos(n0)di 

TT 


hence 


C.(C)  =  /   6(Tr-r.)  J  (£p)dp 
i       0 


(3) 


■  J0(^i} 


2  2 


where  C  =  /^2+^2  . 

Turning  our  attention  to  one  of  the  scatter  components, 
which  is  characterized  by  the  bivariate  gaussian  distribu- 
tion, we  have  the  pdf: 


.,       o.2/2    a.2/2 
fs  (x,y)  =  — izzzzz  e 


2p 


l 

2 


i         °i2 

e 


/2ttct.2 

1 


where  again  p  =  /  2,  2  . 

Prom  Reference  3,  item  26.1.26  the  characteristic  function 


is 


Cs  (C)  =  e    "  .   •  (5) 

i 


where  again  c  =  /  r2+    2  . 


We  now  define  the  scatter  vector  to  be  the  sum  of  all 
the  independent  scatter  components.   Our  motive  for  preced- 
ing to  the  characteristic  functions  becomes  clear  since  the 
characteristic  function  of  a  sum  of  independent  random 
vectors  is  simply  the  product  of  their  characteristic  func- 
tions.  Hence,  we  have  for  the  characteristic  function  of 
the  scatter  component: 
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n 
Cq(c)  =   It   e 
b      1=1 


„  2r2 


=  e 


,2   ?    2 
C    E  a. 

X  i=l  i 


(6) 


=  e 


2     n    p 
where  a   =   £  a.  .   The  characteristic  function  of  this 

i=l  1 

scatter  vector  and  one  specular  component  is  then 


c(c)  =  cs(c)  c1(c) 


2  2 

=    e  "^  J  (tr.) 

o    1 


By  adding  a  total  of  n  specular  components  the  character- 
istic function  becomes: 


2  2 

a  c 


n 


C(C)  =  e 


n  J  (cr. ) 

.  n   o  *  i 

i=l 


(7) 


The  characteristic  function,  as  we  have  developed  it  here, 
is  convenient  for  expressing  only  the  even  moments  since 


2         \  m 

E[p2m]  =  (_i)m[  JL+I   1|    c(e) 


3C 


2    C   9C 


c=o 


The  second  moment  (m  =  1)  is 


2      2     n    2 
E[p*]  =  a  +      Z    r/ 

i=l 
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The  pdf  for  |v|  may  be  expressed  in  terms  of  the  Humbert 
hypergeometric  series  (see  Ref.  2): 

/  r  2     r  2 

f|Vi(p)  =  H  ^2  |i;i  ,ia...,i;-  *§,-  -%-,...  ^V 

1  '      a     I  a    a       a 

Let  us  now  restrict  ourselves  to  digital  systems  where 
we  may  relate  the  signal  pdf  above,  to  the  probability  of 
an  error  $.   We  define  the  probability  of  an  error  as  the 
expected  value,  averaged  over  the  distribution  of  the 
signal  amplitude,  of  a  "probability  of  error  function." 
In  other  words 


6  =  E[p  (p)] 


/   p  (p)  f,v,(p)dp 
0  '  ' 


(9) 


The  'probability  of  error  function'  is  determined  by  the 
modulation/detection  procedure  and  the  noise  environment. 
It  represents  the  probability  of  making  an  error  at  a  par- 
ticular time  as  a  function  of  the  signal  amplitude  and  a 
given  type  of  noise.   Equivalent ly ,  it  may  be  viewed  as 
either  the  probability  of  a  false  alarm  or  the  probability 
of  a  miss  in  detection  theory.   We  consider  only  the  case 
of  noncoherent  frequency-shift-keying  in  stationary,  zero 
mean,  gaussian  noise.   If  the  signal  duration  is  T  ,  the 
receiver  bandwidth  B,  and  the  total  noise  power  is  the 
receiver  pass  band  is  N  ,  then  the  probability  of  making  an 
error  for  a  given  signal  amplitude  is  given  by  Montgomery 
[4]  as: 
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T  Bp2 
s 

2N 

pe(p)  =  h   e     U   .  (10) 

We  note,  in  passing,  that  in  case  of  constant  signal  energy 
where  p  =  /E,  our  definition  of  the  probability  of  an  error 
is  consistent;  since  in  this  case, 

fv(p)  =  5(p-  /E) 

and 

TsBp2 

~"2N7~ 
3  =  /  he  6(p  -  /E)  dp 

0 

TSBE 
=  h   e 

which  is  the  well  known  result,  giving  the  bit  error  prob- 
ability as  a  function  of  signal  energy /noise  energy. 

With  the  signal  duration,  T  ,  receiver  bandwidth,  B, 
and  noise  power  N„,  we  may  utilize  equations  9  and  10 
together  with  the  envelope  pdf  given  in  equation  8  to 
obtain  the  probability  of  an  error.   The  result,  from 
Nesenbergs  [Ref .  2]  ,  is 

Nn       1  I  rl  rn    \ 

"  "  ri  ■  ^^  *2   i;i.i.....ii  -r^'—T^o] (11) 

s  s  s 

where  r.  =  amplitude  of  the  i    specular  component  and 

a2  variance  of  the  scatter  components  of  the  received  signal 
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Nesenbergs  has  developed  bounds  for  this  hypergeometric 
series  in  Reference  2  which  yield: 
2 


o      2N„ 

2  ,        0 

(J            +     rr,        ^ 

1    e                 s 

1 

<     8     <    £■ 

2                     „    2 

—        —  2 

T   Bo 

1     4-        s 
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T  B(r2+a2) 
s 


N 


1  + 


2N 


T  Br 
s 


2N 


0 


T  B(r2+c2) 


1  + 


0 


2N 


0 


2     2 
where  r  =  E  r.  .   Figure  2.3  is  a  plot  of  some  special  cases 

calculated  by  Nesenbergs' in  Ref.  2. 

It  can  be  seen  from  Table  I  that  as  the  frequency  is 

decreased  below  the  MUF,  other  propagation  paths  may  occur, 

resulting  in  the  bit  error  probability  as  described  in 

Equation  11.   While  for  frequencies  well  below  the  MUF,  the 

signal  amplitude  pdf  may  be  very  nearly  Rayleigh,  for 

frequencies  near  the  MUF  the  Rayleigh  approximation  is  not 

appropriate.   One  could  reasonably  expect  the  ratio  of  the 

operating  frequency  to  the  MUF  will  be  roughly  related  to 

the  bit  error  probability.   Increasing  the  ratio  toward 

unity  results  in  fewer  modes  with  a  corresponding  decrease 

in  the  error  probability,  while  a  decrease  in  the  ratio 

indicates  the  possible  existance  of  more  modes  causing  an 

increase  in  the  error  probability.   This  effect  is  experienced 

during  periods  of  the  day,  such  as  sunrise  or  sunset,  when 

the  MUF  is  changing  rapidly,  and  the  carrier  frequency  is 

fixed.   Tsai  [5]  and  McManamon  [6]  have  also  noted  this 

effect  in  more  quantitative  terms. 


o  n 


n=2,r  =r2=— 


i    2      2 
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30 
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Figure  2.3.   Bit  error  probability  3  as  a  function 
of  signal  to  noise  ratio  (db).  (Reprinted 
from  Ref.  2,  p.  82,  with  permission.) 
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There  are,  of  course,  other  sources  of  error  in  the  HF 
channel.   One  such  source,  which  may  have  an  appreciable 
effect  on  the  error  probability,  is  atmospheric  noise.   The 
next  section  deals  with  this  noise  in  some  detail. 

C.   ATMOSPHERIC  NOISE 
1 .   Introduction 

Atmospheric  noise  is  the  name  generally  associated 
with  electromagnetic  energy  which  has  its  source  in  natural 
phenomena  such  as  lightning  discharges.   In  general,  atmos- 
pheric noise  is  impulsive  and  nonstationary .   Geographically, 
the  largest  dynamic  range  occurs  v/ithin  the  tropics,  from 
whence  it  propagates  into  the  northern  and  southern  hemis- 
pheres.  For  any  particular  location  the  received  rms  or 
average  power  will  be  determined  by  local  thunderstorm 
activity,  as  well  as  the  propagation  conditions  to 
the  tropics.   Although  the  largest  dynamic  range  (on  the 
order  of  100  db )  occurs  in  the  lower  portion  of  the  frequency 
spectrum  (ELF-LF),  it  may  be  the  limiting  type  of  noise 
within  a  major  portion  of  the  HF  spectrum.   Statistical 
parameters  such  as  average  power,  and  average  envelope 
voltage  have  been  measured  quite  accurately  and  the  results 
published  in  the  form  of  Technical  Notes  by  the  National 
Bureau  of  Standards,  and  the  International  Radio  Consulta- 
tive Committee  (CCIR  Report  322,  "World  Distribution  and 
Characteristics  of  Atmospheric  Radio  Noise"  [7]). 
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Numerous  models  for  atmospheric  noise  are  contained  in 
the  current  literature  [Refs.  8  and  91.   Most  of  these  models 
are  based  on  an  impulse  model  with  the  impulses  distributed 
according  to  some  particular  rule.   Models  such  as  these 
have  two  principal  difficulties,  first  the  parameters  of 
the  distribution  are  chosen  so  they  match  the  first  order 
statistics  of  the  received  noise   and  have  very  little 
correlation  with  statistics  such  as  pulse  interval  distri- 
bution or  pulse  duration  distribution.   Secondly,  there 
may  be  serious  computational  difficulties.   A  more  recent 
model  was  proposed  by  H.  M.  Hall  [Ref.  10].   The  Hall  model 
seems  to  fit  the  measured  statistics  of  the  noise  and  hence 
will  be  used  for  the  remainder  of  this  section. 

2.   The  Hall  Model 

The  noise  y(t)  is  considered  to  be  the  product  of  a 
zero  mean  narrow  bandpass  gaussian  process  n(t)  with  a 
covariance  function  R  (t),  and  a  slowly  varying  regime  pro- 
cess A(t)  which  is  assumed  to  be  independent  of  n(t)  and 
stationary.   Although  the  actual  atmospheric  noise  is  non- 
stationary,  it  may  be  approximated  by  a  stationary  process 
for  a  reasonably  short  period  of  time.   The  statistics  of 
the  modulating  regime  process  are  chosen  so  the  resulting 
product  y(t)  =  A(t)  •  n(t),  is  an  accurate  description  of 
the  received  noise. 

Summarizing  Hall's  treatment  we  have  the  following  pdf 
for  the  regime  process 


?0 


fa(a)    =   k 


L?a2         a2j 


m+1 


-  6  <    a   <    6         (12) 


(a   chi    distribution  with  parameters   m   and   a) ,    where   k   is 
chosen   so   that 


/     f   (a)da  =    1 
-6      a 


(13) 


The  use  of  a  finite  limit  for  the  amplitude  is  based  on  the 
required  convergence  of  the  integral  for  1  <_  m  <_  2  ,  since 
the  energy  radiated  must  be  finite,  and  to  match  the  observed 
levels  at  the  higher  frequencies  in  the  HF  band. 
The  gaussian  component  is 
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f  (n)  = 
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2Q 


/27TO  2 
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(14) 


The  calculation  of  the  pdf  for  y(t)  is  straight  forward 
[Ref.  11] 

f  (y)  =  f°°   p  (x)  p  y  ^-r- 
y  J  m     a     n  x   |  x| 

Substituting  equations  13  and  41  into  15  results  in 
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e  dx 
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Noting  that  the  integrand  is  an  even  function  and  making  the 

substitution  £  =  — r,  f  (y)  becomes 

x    y 
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or  equivalently 


f„(y)    =   —  •  /      5   2      e     ■  n  d? 
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n      62 
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fv(y)    =  k S+T        /  .A 

/2tto..       /    m    +  _y_ |    2  1  /    m     |      y 
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f         /    m 


2      2a   2I  «2bo2      2a   2I 
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If  we    consider   the    case   where    6    ■>  °°j    then 


k  = 


(g)m/2 

m  „  /mi 
a 


r(|) 

and  equation  16  reduces  to 


^  r(?)(-^-  +  y2J 


f  (y)  =  .  Q K-^— tt  (17) 

y  »   o     .  m+l 


In  order  to  compare  this  mathematical  model  with  the 
measured  atmospheric  noise  we  must  find  the  complement  of 
the  probability  distribution  function  for  the  envelope  and 
phase.   We  let 

y(t)  =  V(t)Cos(w  t  +  <j>)  . 

The  pdf  of  the  envelope  and  phase  may  then  be  obtained  In 
terms  of  the  inphase  and  quadrature  components  of  the 
received  noise  as 
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//  f   (V,<())d<j>dV  =  //  f  ~(y,y)dydy 
V*    ,9  YY   y'y 

where  f   ~(y,y)  is  the  joint  pdf  of  the  inphase  (y)  and 
quadrature  (y)  component  of  the  received  noise.   Changing 

variables  on  the  right  hand  side  of  the  equality  sign  to 

2 
y  =  VCos<J>,  y  =  VSin4>,  yields  the  Jacobian,  J  =  V(Cos  <$>   + 

Sin2<f>)  =  V  and 


//    fv      (V34))d(J)dV  =    //  V   f      ~(V   cos 4),   V   sincj))    dVd<|) 


hence 


f      A(V.cf))    =   V   f      ~(V   cos<}>,   V   sin<J>) 
rs<t>      ,r  y,y 

We  let  n(t)  represent  the  quadrature  component  of  the  narrow 
band  gaussian  noise  and  y(t)  =  A(t)n(t)  and  y(t)  =  A(t)n(t) 
where  f~(n)  =  f  (n)  =  N[0,R  (0)],  and  n(t)  is  Independent 
of  n(t).   We  may  now  express  the  joint  pdf  as: 

oo  ~ 

f   ~(y,y)  =  /   f  (X)  f  (£)  f~(^-)  ~    dx 

y  ,y  a     n  \x/   n\x/   2 
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If  again  we  let  6  -»•  °°,  then 
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The  complement  of  the  desired  distribution  function  is  simply 
the  probability  that  the  noise  envelope  voltage  exceeds  some 
threshold,  V*n ,  i.e. 


P(V  >VQ) 


/  p„(V)dV 
vo 
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V0  +  Y   2 


For  convenience  we  define 


P(VQ) 


P(V  >v0) 
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Figure  2.4  is  a  plot  of  the  envelope  level  (in  db  above  y) 
vs.  -log, n | In  P(Vn) |  for  various  values  of  m.   Figure  2.5 
and  2.6  are  plots  of  actual  measured  atmospheric  noise  at 
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Figure  2.4.   APD's  obtained  from  the  Hall  model 
for  various  values  of  m 
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Figure  2.5.   APD  of  atmospheric  noise  measured  at  Laramie, 
Wyo.  2350-2358  MST  16PEB  1970,  at  a  frequency  of  2.5 
MHz.   A  indicates  measured  values,  the  dashed  line 
is  obtained  from  the  Hall  model  with  m  =  3. 
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Figure  2.6.   APD  of  atmospheric  noise  measured  at  Laramie, 
Wyo.  2335-2343  MST  17  FEB  1970,  at  a  frequency  of 
5.0  MHz.   A  indicates  measured  values,  the  dashed 
line  is  obtained  from  the  Hall  model  with  m  =  4. 
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frequencies  of  2.5  MHz  and  5.0  MHz  respectively.    A  plot 
of  the  form  of  figure  is  an  amplitude  probability  distribu- 
tion (APD)  and  represents  the  percentage  of  time  a  given 
level  is  exceeded  as  a  function  of  the  level.   The  plotting 
paper  is  designed  so  that  thermal,  or  gaussian  noise  plots 
as  a  straight  line  with  slope  -h.      Montgomery,  in  1953 »     [*U 
showed  that,  for  FM  or  FSK  modulation  systems,  the  probabil- 
ity of  a  bit  error  is  one  half  the  probability  that  the 
noise  amplitude  exceeds  the  signal  amplitude.   Hence  if  the 
signal-to-noise  ratio  is  10  db ,  one  has  simply  to  find  the 
percentage  of  time  the  level  corresponding  to  10  db  above  the 
rms  noise  level  is  exceeded.   This  percentage  will  then  be 
twice  the  bit  error  probability.  As  an  aside,  an  approximation 
to  the  APD  may  be  inferred  from  the  data  in  CCIR  Report  322  [71. 

As  in  the  case  of  the  signal  amplitude,  we  have  only 
obtained  an  'average'  bit  error  probability.   There  is  no 
indication  of  the" correlation  between  errors.   Present 
research  (Spaulding,  Disney,  and  Espeland  [12])  indicates 
that  the  burst  interval  distribution  and  burst  duration 
distribution  may  be  matched  as  closely  as  desired  by  proper 
specification  of  the  covariance  function  for  the  modulating 
regime  process  A(t).   While  this  is  an  important  factor,  it 
is  not  within  the  intent  of  the  present  paper,  so  we  will 
simply  give  some  typical  cases  to  indicate  the  character- 
istics of  the  burst  parameters.   Figures  2.7  and  2,8  are 


*The  data  presented  here  was  obtained  at  Laramie,  Wyoming 
during  the  period  16-18  Feb.,  1970  while  the  author  was  in 
guest  worker  status  with  the  Environmental  Science  Services 
Administration. 
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Figure  2.7.   Pulse  Duration  Distribution  of  Measured 
atmospheric  noise.   Recorded  on  4  April  1969 
between  0000  and  0^00  MST  at  Boulder,  Colorado 
at  frequency  5  MHz. 
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Figure  2.8.   Pulse  interval  distribution  of  atmospheric 
noise.   Recorded  on  h   April,  1969  between  0000  and 
0^00  MST  at  Boulder,  Colorado  on  a  frequency  of 
5  MHz. 


plots  of  the  pulse  duration  distribution  and  pulse  interval 
distribution  respectively,  for  atmospheric  noise  at  5.0  MHz. 
For  completeness,  the  APD  for  this  set  of  samples  is  also 
plotted  as  Figure  2.9. 

Since  the  median  signal  energy  to  noise  energy  ratio  is 
nominally  greater  than  20  db  for  the  HF  channel,  we  will 
use  this  figure  to  obtain  estimates  of  the  effect  of  the 
atmospheric  noise  on  the  channel.   It  may  be  seen  from 
Figure  2.9  that  a  20  db  signal  to  noise  ratio  will  corres- 
pond roughly  to  a  bit  error  probability  approximately 
5  x 10   while  Figure  2.7  indicates  that  for  the  pulse 
duration  distribution  only  7%    of  the  pulses  were  greater 
than  .6  msec  in  duration.   (The  20  db  signal  to  noise  ratio 
corresponds  to  level  5.)   A  similar  analysis  of  Figure  2.8 
indicates  that  55$  of  the  intervals  between  bursts  were 
longer  than   60  msec.   For  many  HF  circuits,  particularaly 
where  a  large  signal  to  noise  ratio  exists,  the  atmospheric 
noise  introduces  essentially  random  errors. 

D.   MODELING  THE  HF  CHANNEL 

The  preceding  sections  developed  a  relation  between  the 
bit  error  probabilities  and  the  number  of  multipaths  in 
existance  between  the  transmitter  and  receiver.   The  behavior 
observed  is  not  new  or  unexpected;  it  has  long  been  known 
that  as  few  as  six  equal  amplitude  signals  of  random  phase 
may  be  summed  to  give  a  distribution  which  is  nearly  Rayleigh 
We  must,  however,  bear  in  mind  the  assumptions  utilized  to 
arrive  at  the  results  obtained:  recall  that  we  assumed 
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Figure  2.9.  APD  of  atmospheric  noise  measured  at  Boulder, 
Colorado  0000-  0  400~MST,  4  April  1969,  at  a  fre- 
quency of  5.0  MHz. 
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stationary  and  statistically  independent  specular  and 
scatter  components.   It  should  be  mentioned  that  some  of 
the  multipath  conditions  may  result  from  magneto-ionic 
splitting  and  may  be  correlated.   In  addition  we  did  not 
discuss  the  time  delay  (which  may  be  as  much  as  several 
milliseconds)  nor  the  frequency  distortion  or  doppler.   In 
general  we  are  dealing  with  a  channel  where  there  is  corre- 
lation between  errors ,  and  even  groups  of  errors .   In 
graphical  terms  the  bit  error  probability  may  be  expressed 
as  a  function  of  time  as  in  Fig.  2.10.  Time  periods,  when  the  error 
probability  is  relatively  high,  could  well  result  from  the 
existance  of  multiple  propagation  paths,  while  the  periods 
of  small  error  probability  are  most  likely  due  to  little  or 
no  multipath  and  only  atmospheric  noise  corrupting  the  signal, 
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Figure  2,10.   Error  Probability  as  a  Function  of  Time. 
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Another  factor  which  limits  the  use  of  the  channel  is 
the  differential  time  delay  incurred  by  the  signal  traveling 
different  paths.   This  time  delay  places  a  lower  limit  on 
the  signal  duration  (for  pulsed  systems)  since,  if  the 
signal  duration  is  about  equal  to  the  time  delay  over  one 
of  the  propagation  paths,  it  will  be  received  during  the 
subsequent  signal  time  frame.   This  differential  time  delay 
is  on  the  order  of  5  msec  and  hence  a  lower  limit  on  the 
signal  duration  is  between  5  and  10  msec.   To  make  more 
efficient  use  of  the  allocated  channel  bandwidth,  the  normal 
procedure  is  to  frequency  multiplex  several  channels  onto  a 
common  carrier  and  utilize  a  signal  duration  of  about  10  msec 
on  each  of  the  sub-channels.   Again  propagation  anomilities 
such  as  the  doppler  shift  may  cause  'cross  talk'  between 
adjacent  channels.   This,  in  many  instances,  results  in 
frequency  selective  fading  where  the  fading  is  confined  to 
a  small  section  of  the  transmitted  bandwidth. 

In  order  to  describe  the  channel  in  qualitative  terms 
we  need  to  know  how  the  errors  occur.   The  simple  bit  error 
probability  is  not  sufficient. 

If  the  message  is  to  be  transmitted  as  binary  digits 
suitably  modulating  a  radio  frequency  carrier,  then  a  burst 
of  length  N  is  defined  as  a  sequence  of  N  digits  beginning 
with  a  bit  in  error  that  is  immediately  preceded  by  a  correct 
bit,  and  ending  with  an  error  which  is  followed  by  a  correct 
bit.   The  burst  interval  is  defined  as  the  interval  between 
bursts.   It  begins  with  the  first  correct  digit  following  a 
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burst  and  ends  with  the  last  correct  digit  preceding  the 
next  burst.   A  gap  of  length  m  is  defined  to  be  a  sequence 
of  m  consecutive  error  free  digits  between  two  successive 
errors.   A  gap  may  occur  within  a  burst  or  actually  be  the 
burst  interval. 

It  is  apparent  that  the  sequence  of  received  digits  may 
be  completely  punctuated  by  the  errors  Into  bursts  and  burst 
intervals,  however  this  partitioning  Is  not  unique.   The 
burst  length  and  burst  interval  distributions  are  statistics 
of  the  channel  and  must  either  be  known  or  at  least  a 
reasonable  statistical  estimate  should  be  available.   Since 
the  partitioning  is  not  unique,  the  burst  and  burst  interval 
should  not  be  used  to  characterize  the  channel.   To  avoid 
this  difficulty,  we  return  to  the  gap  distribution.   While 
the  gap  distribution  will  affect  the  burst  length,  it  is 
not  subject  to  interpretation  ambiguities  and  may  be  used 
to  uniquely  determine  a  possible  channel  model. 

There  are  two  primary  methods  of  characterizing  the  HF 
channel.   The  first  consists  of  actually  recording  error 
patterns  and  statistics  for  a  reasonably  extensive  period 
of  time  and  assuming  these  are  representative  of  the  channel 
The  data  may  then  be  utilized  to  determine  the  effectiveness 
of  various  coding  schemes  by  actual  simulation  or  the 
performance  estimated  by  the  coding  parameters.   The  second 
approach  is  to  use,  again,  measured  parameters  to  infer  a 
stationary  homogenous  Markov  chain  model  which  may  then  be 
utilized  to  characterize  the  channel.   The  usefulness  of 


45 


this  method  follows  from  the  properties  of  the  stationary, 
homogeneous  Markov  chain,  since  the  chain  is  completely 
characterized  by  the  matrix  of  one  step  probability  transi- 
tions.  This  approach  is  presented  in  detail  by  Fritchman 
[131 ,  Tsai  [51 ,  and  McManamon  [6] ,  together  with  results 
which  are  in  very  good  agreement  with  measured  data.   The 
aperiodic  Markov  chain  model,  as  described  by  McManamon,  is 
completely  determined  by  the  gap  distribution.   Since  a  gap 
of  length  n  is,  by  definition,  a  sequence  of  n  consecutive 
error  free  digits  which  are  immediately  preceded  and  followed 
by  errors,  the  probability  of  a  gap  of  length  n,  f(n)  may  be 
interpreted  as  the  return  to  an  error  state  in  exactly  n+1 
steps.   Therefore,  f(n)  is  simply  the  first  return  probabil- 
ity distribution  of  the  Markov  chain.   The  mean  return  time 
may  then  be  expressed  as 

oo 

u  =   Z   nf(n) 
n=l 

The  stationary  distribution  of  the  Markov  chain  is  of  con- 
siderable practical  importance.   Denoting  the  probability 
that  the  output  of  the  channel  at  time  j  is  in  error  by  P(j), 
we  may  then  ask  'How  does  P(j)  behave  as  time  progresses?' 
The  answer  is  given  by  the  Ergodic  Theorem  for  Markov  chains 
(Cox  and  Miller  [1^]),  where  the  stationary  distribution  is 
defined  in  terms  of  the  mean  return  time.   The  standard 
definition  for  the  first  return  in  the  Markov  chain  includes 
the  return  state.   Therefore,  in  terms  of  the  gap  distribu- 
tion f (n) ,  as  defined  above, 


l\6 


- 


Lim  P(j)  =  — iy  =  6 


where  3  is  the  average  bit  error  probability.   This  simply 
states  that  after  a  long  period  of  time  the  average  prob- 
ability of  the  output  being  in  error  is  independent  of  the 
initial  distribution  and  is  equal  to  the  inverse  of  the 
average  gap  distribution. 

In  summary,  the  HF  channel  is  neither  the  classical 
random  error  channel  nor  the  typical  burst  channel,  but 
rather  a  hybrid  channel  which  may  contain  may  contain  many 
long  bursts  between  which  occur  random  errors.  For  the 
convenience  of  the  reader,  the  measured  burst  and  burst 
interval  distributions  given  by  Tsai  [51  are  reproduced 
here  as  Figures  2.11  and  2.12  respectively. 

Figure  2.11  may  be  used  to  obtain  the  burst  duration, 
for  example  80  percent  of  the  bursts  are  less  than  20  bits 
in  duration.   Figure  2.12  gives  the  interval  between  bursts, 
and  by  a  similar  examination  it  can  be  seen  that  HO   percent 
of  the  burst  intervals  were  less  than  1200  bits. 
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Figure  2.11.   "Burst  distribution  for  an  HF  channel" 
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Figure  2,12.   "Burst  interval  distribution  for  KF 
channel . 
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III.   CODING 

A.   INTRODUCTION 

In  the  preceding  chapter  the  effect  of  the  channel  on 
the  transmitted  information  was  described.   It  should  be 
noted  that  the  modulation/detection  procedure  assumed  is 
non-coherent  frequency-shift-keying.   This  assumption  was 
made  since  it  is  in  wide  use  and  of  considerable  practical 
importance . 

Before  we  discuss  specific  coding  procedures  let  us  first 
review,  in  a  very  basic  manner,  what  we  want  to  do,  and  what 
is  available.   A  discrete,  or  digital  communication  system 
in  which  the  performance  may  be  described  by  error  rates  and 
probabilities  Is  assumed.   The  first  question  that  comes  to 
mind  is  "Is  there  some  'source'  which  we  could  utilize  in 
the  channel  to  obtain  the  maximum  rate,  R,  of  communication 
with  the  smallest  error  rate?".   The  answer  is  yes;  for  a 
significant  class  of  channels,  Shannon  has  shown  that  a  non- 
zero capacity,  C,  exists  such  that 

C  =  max  R 
[source] 

with  an  arbitrarily  low  error  rate.   In  general  it  is  not 
possible  to  engineer  a  system  to  the  point  where  R  =  C,  and 
the  probability  of  an  error  is  extremely  small.   We  cheese 
to  modify  the  question  and  ask  "For  some  rate  R  (<  C),  can 
we  significantly  reduce  the  probability  of  an  error?".   In 
other  words  we  are  not  really  trying  to  solve  the 
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"optimum"  problem  but  rather  a  problem  of  what  is 
reasonable . 

If  we  assume  for  the  moment  that  we  have  a  return 
channel,  we  have  available  simple  error  detection  tech- 
niques, such  as  ARQ  (Automatic  Repeat  Request)  and  the 
parity  check  schemes.   The  ARQ  systems,  introduced  by 
Van  Durren,  are  frequently  referred  to  as  3-out-of-7,  or 
"constant  ratio  codes,"  where  the  encoded  messages  consist 
of  seven  bits  in  which  three  'ones'  and  four  'zeros'  are 
required  to  form  a  valid  code  word.   Any  single  error, 
and  errors  of  odd  multiplicity  (3,5,  and  7),  will  be 
detected  while  only  half  of  the  even  errors  will  be 
detected.   The  remaining  even  errors  will  then  constitute 
'real'  errors  at  the  output. 

In  the  parity  check  system  a  sequence  of  n-1  information 
carrying  digits  are  appended  by  a  single  digit,  known  as  the 
parity  check  digit,  so  that  the  total  number  of  ones  in  the 
n  sequence  is  even.   Again  at  the  receiver,  the  parity  is 
checked  and  if  this  check  fails,  an  error  has  been  detected. 
It  is  again  apparent  that  this  scheme  will  detect  all  errors 

of  odd  multiplicity.   The  efficiency  is  much  higher  being 

n-1 

,  however,  the  rate  again  suffers  by  the  retransmission. 

Another  alternative,  which  is  more  attractive,  is  a 
system  where  error  correction  is  possible.   In  general,  the 
error  correcting  code  which  will  correct  't'  errors,  con- 
sists of  a  mapping  of  k  information  digits  into  n  channel 
digits  in  such  a  fashion  that  at  least  2t  +  1  channel  errors 


will  be  required  to  change  any  valid  transmitted  word  into 
another  valid  word.   The  decoding  may  be  done  in  many  ways. 
One,  becoming  more  promising  as  the  state  of  the  art 
develops,  is  by  a  table  look-up  from  a  read-only-memory. 
In  this  procedure  the  received  word  is  compared  with  the 
valid  code  words  and  the  one  which  differs  in  the  fewest 
positions  is  then  chosen  as  the  most  likely  one  to  have 
been  sent , 

In  the  above  paragraphs  we  tacitly  assumed  that  any  set 
of  n  channel  digits  were  determined  by  a  unique  set  of  k 
information  digits,  and  conversely  one  set  of  k  information 
digits  uniquely  determined  one  set  of  n  channel  digits; 
this  corresponds  to  block  coding.   Another  form  of  coding 
permits  any  set  of  k  information  digits  to  affect  several 
blocks  of  channel  digits,  and  conversely  any  block  of  channel 
digits  will  be  affected  by  several  blocks  of  information 
digits;  this  corresponds  to  convolutional  (or  recurrent) 
coding. 

If  the  i    sequence  of  n  channel  digits  contains  the 
corresponding  i    sequence  of  information  bits  explicitly, 
the  code  is  said  to  be  systematic.   Systematic  codes  may 
have  a  marked  advantage  over  non-systematic  codes  in  that 
they  may,  of  course,  be  decoded  with  error  correction  by 
those  with  the  decoding  equipment,  while  those  with  only  a 
limited  amount  of  equipment  may ,  with  proper  synchronization, 
simply  regenerate  the  code  from  the  appropriate  information 
digits  and  use  the  result  for  error  detection  alone,  relying 
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on  a  retransmission  to  correct  the  indicated  errors.   Those 
who  have  little  equipment  may,  for  the  price  of  timing, 
have  access  to  the  information  digits  alone. 

In  summarizing  the  above  procedures,  we  must  again  look 
at  the  channel.   If  the  operational  requirements  are  such 
that  a  two  way,  or  return  circuit  is  not  possible  (for 
example  in  a  fleet  broadcast  network)  the  error  correction 
capability  is  highly  desired. 

B.   CONVOLUTIONAL  CODES 

Let  us  begin  by  describing  a  few  items  which  are  very 
basic  to  the  discussion  which  follows.  The  term  encoding 
is  used  to  imply  a  transformation  or  mapping  from  the  set 
of  information  sequences  into  another  set  of  channel 
sequences,  as  illustrated  in  Figure  3.1.  The  code  is  the 
image  of  the  transformation. 


Figure  3.1.   Mapping  provided  by  code. 
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We  restrict  our  attention  to  binary  codes,  where  the  ele- 
ments of  the  information  and  channel  sequences  are  either 
0,  or  1.   If  each  input  sub-block  contains  a  digits,  while 

the  corresponding  channel  sub-block  contains  y   digits,  the 

ex 
code  is  said  to  be  a  rate  —  code  and  designated  as  a 

Y 

"(y,a)  code."   Since  we  are  dealing  with  binary  codes,  the 
addition  we  will  use  is  Modulo  2,  where 

0  +  0  =  1  +  1  =  0 
and 

1  +  0  =  0  +  1=1 

A  convolutional  code  is  said  to  have  memory  span   v,  if 
each  encoded  digit  is  a  linear  combination  of  information 
digits  in  the  present  set,  and  at  most  v  -  1  of  the  preced- 
ing sets  of  information  digits.   It  should  be  noted  in 
passing,  that  if  v  =  1,  the  code  corresponds  to  a  block  code. 

Another  parameter,  which  is  closely  related  to  the 
memory  span,  is  the  actual  constraint  length  (N)  ,  which  is 
defined  for  a  convolutional  code  of  memory  span  v,  as 
N  =  vy .   This  constraint  length  plays  a  role  in  convolutional 
codes  similar  to  the  block  length  in  block  codes. 

For  Convolutional  codes,  we  define  t.  .as  the  j  digit 
of  the  i  set  of  channel  digits,  in  terms  of  message  digits 
mk(l)  ,  as 

v-1   a 

t.  .  =   £    Eg   .(n)m.   (n)       1  <  j  £  Y      (22) 


ij      n         -,  Q3J  l-a 

q=0  n=l 


(Modulo  2) 


F-5 


where  m._  (n)  is  the  n    digit  of  the  (i-q)    set  of  infor- 
mation digits,  and  g   . (n)  is  either  0,  or  1  depending  on 

0.  i  J 

whether  or  not  m.   (n)  is  to  be  included  in  the  j    trans-. 
mitted  digit.   Prom  equation  22,  we  can  see  that  any  partic- 
ular message  digit  can  affect  at  most  v  sets  of  y  transmitted 
digits,  which  prompts  the  definition  of  memory  span. 

Turning  our  attention  to  the  elements  g   .(n),  the 
choice  of  values  for  these  elements  will  clearly  determine 

the  properties  of  the  code.   If  we  define  g   . (n)as: 

Q  >J 

S0,J(n)  ■   J,n  1  U  i« 

g„  ,(n)  =  0  1  <  0  <  a  q  /  0 

4  >  J 

where  6.  .  is  zero  if  j  ^   k,  and  1  if  j  =  k.   The  first  a 

j  ,k  ' 

digits  of  the  i    set  of  channel  digits  will  then  be 
identical  with  the  a  digits  of  the  i    set  of  information 
digits,  in  which  case  the  code  is  said  to  be  systematic 
(or  canonic  systematic).   In  most  of  the  following  we  will 
treat  systematic  codes  only,  since  Massey  [15]  has  shown 
that  any  non-systematic  code  can  be  converted  to  an  equiv- 
alent systematic  code  without  affecting  the  error  correcting 
capability  of  the  code.   In  general  the  constraint  length  of 
the  non-systematic  code  will  be  shorter  than  that  for  the 
equivalent  systematic  code. 

At  this  point  a  short  example  might  help  to  further 
explain  the  procedures. 

Example  1.   A  Systematic  (3,1)  convolutional  code  with 
memory  span  5.   From  equation  22  we  have 
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v-1   a 
t,  ,  =   2    Z   g   ,(n)m,   (n).  (El-1) 

1»j   q=0  n=i  q>J   i  q 

Since  the  desired  code  has  a  =  1 .  we  let  m.   (1)  =  m.   ,  and 

3  l-q        l-q  ' 

g   .  (n)  =  g   .,  we  may  rewrite  equation  El-1  as 
Q  j  J       Q  >  J 

v-1 
t.  .  =   E   g   .m. 

iaJ   q=0  q,j  i-q 

Now,  since  the  memory  span  v  =  5 ,  we  have  v  -  1  =  4  and: 

'1.J  =  g0,jmi  +  gl,jmi-l  +  g2,jml-2  +  g3,jmi-3  +  ^,jmi-4- 

Since  the  code  is  systematic,  we  know 

§0,1  =  1;   and   gq,l  =  0.  • 

Let  us  choose,  somewhat  arbitrarily, 

g0,2  =  gl,2  =  g4,2  =  lm>      g2,2  =  g3,2  =  °       (E1"2) 

and  , 

g0,3  =  g2,3  =  1;   gl,3  =  g3,3  =  g^s3  =  °' 

The  code  now  may  be  written  as 

t .  ,  =  m. 
i,l    l 

t.  0  =  m.  +  m.  ,  +  m.  ,. 
1,2     l     l-l     i-4 

t .  ~  =  m .  +  m .  0 
i,3     i     1-2 

For  this  code  the  actual  constraint  length  is  N  =  yv   =  3x5  =  15. 
A  different  method  of  visualizing  this  encoding  procedure 
was  developed  by  Wyner  and  Ash  [16],  and  consists  in  consider- 
ing g   .  (l),,..,g   .(a)  to  be  a  row  vector  as 

q }  J       q  >  j 


C  tz 


G   .  =  [g   .  (1)  g   . (2)  . . .  g-   . (a)] 

q,j    &q,j    eq,j      &q,j 

and  M._   a  column  vector  of  the  (i-q)    set  of  information 
digits  such  that 

M.    =  [m.   (1)  m.   (2)  ...  m.   (a)]T. 
i-q    l  i-q     i-q        i-q 


Then 


v-1 

t.  .  =   £   G   .M 


i  ,J 


q  =  0 


q,j  i-q 


If  we  now  let  T.  be  the  column  vector  of  the  i    set  of 

i 


transmitted  digits,  we  have 


T.  = 

l 


t.   , 

1}1 

t.   0 

i52 

• 

• 

t. 
1>Y 

v-1,1  • 
fv-l,2  * 


rv-l,y 


Gl,l   G0,l 
Gl,2   G0,2 


G  -,  G  „ 

1,Y    0,y 


M 


r-1 


i-v+l 


i-v+2 


M 


By  dealing  only  with  systematic  codes  we  may  greatly  simp- 
lify the  matrix,  since  gn  .  (n)  =  6.    and  g   '.  (n)  =  0  for 

u»j       J  j^      q,J 

q  j-    0  for  j  between  1  and  a. 


G   t   G   0 
v-1    v-2 


0   '  I 

--  +  - 

G.  ,  G 


0 


M 


M 


i-v  +  l 


i-v+2 


M 


Where  I  is  the  (axa)  identity  matrix,  0  is  the  (a, a)  null 


matrix  and  G.  is  the  (y-a)  x  a  matrix  defined  as 
J 


G.  = 


G.   x1 
j  ,a+l 


G 


j  ,a+2 


G 


J.Y 


^a+l(1)   Sj>a+1(2) 


3j  ,Y 


(1) 


gJ,Y(2) 


gJ,a+l(a) 
gj,a+2(a) 


g  •   (ot) 


(24) 


If  the  number  of  input  message  sequences  is  indefinitely 
long,  we  may  consider  the  transmitted  channel  digits  to  form 
a  semi-infinite  column  vector  T.   Where  T 


T   = 


T. 


v-1 


T 


v 


T 


v+1 


v+2 


I 

0 

0 

Go 

* 

♦ 

0 

I 

0 

G-, 

Gn 

4> 

0 

0 

0 

Gv-2 

Gv-3 

Gv-4 

0 

0 

0 

Gv-1 

Gv-2 

Gv-3 

0 

0 

0 

* 

Gv-1 

Gv-2 

0 

0 

0 

4> 

4> 

G    _i 

0      I 
G1    GQ 


0      0 

4,      <t> 


0      0      10 
G2    G1    Gn    cf> 


0      0 
G q   G„ 


0 

0      I 
G1   G 


0      0      0 
<j>      <J)      (J> 

0       0 


0 


0       0       0       0 


G4    G3 


o    G  -, 


0 

0 

I 
G, 


G 


M, 


+  1 


+  2 


(25.) 


and  (J)  is  a  (y  -a)xa  null  matrix.   For  convenience  we  will 
refer  to  the  semi-infinite  matrix  which  multiplies  the 
column  vector  of  message  digits  as  G  ,  while  the  matrix 


which  generates  only  the  i    set  of  channel  digits  may  be 
defined  from  equation  23  as 


G  = 


0 


0 


.  .  .  |  0    I 


v-1    v-2 


Gn   G 
1  '   o 


(26) 


Although  G^  has  an  infinite  number  of  rows  and  columns,  any 
particular  row  will  have  at  most  av  non-zero  elements  and, 
correspondingly,  any  particular  column  will  have  at  most 
N  =  y^  non-zero  elements. 

Returning  to  example  1,  from  equation  2  3  we  may  write 
the  i    transmitted  sequence  as 


0    0    0    0    1 

T.    = 
l 

10    0    11 

0    0    10    1 

m 


1-k 


in 


i-3 


m 


m 


1-2 


i-1 


m, 


and  by  inspection,  equations  25  and  26  yield 


G  = 


0  0  0  0  1 
10  0  11 
0  0  10  1 


100000000 
100000000 
100000000 
010000000 
110000000 
010000000 
001000000 
011000000 
011000000 
101000000 
000100000 
001100000 
010100000 
000010000 
100110000 
001010000 
000001000 
010011000 
000101000 
000000100 
001001100 
000010100 


The  physical  generation  of  this  code  is  straight  forward 
involving  unit  delay  elements  (shift  registers)  and  modulo 
2  adders  (exclusive  'OR'  gates),  as  shown  in  Figure  3.2. 
The  rectangular  boxes  represent  unit  delay  elements,  while 
the  circles  with  the  cross  represent  the  modulo  2  adders. 


m, 


i,l 


Figure  3.2.   Encoder  for  Rate  •=■  convolutional  code 
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Returning  to  the  general  problem,  when  the  channel 
digits  are  transmitted  through  the  channel,  noise  may- 
corrupt  some  of  them,  causing  the  received  digits  to  differ 
from  the  original  transmitted  digits.   Utilizing  matrix 
notation,  the  i    received  sequence  may  be  expressed  as: 


R 


ri,l 

ri,2 

t 
• 

r . 

= 

t.    , 

t.    ~ 
1,2 

« 

t. 
i, 

+ 

e .    n 
i,l 

e .    0 
i,2 

• 

e . 

i, 

=  T.  +  E. 
l     l 


(27) 


Where  e.  .  is  1  if  an  error  has  occurred  in  the  j    position 
i ,  J 

th 
of  the  i    transmitted  sequence  and  zero  otherwise.   Our 

task  may  simply  be  stated  as  determining  which  element  of 

E.  are  most  likely  to  be  non-zero. 

Since  we  are  dealing  with  systematic  codes,  we  have 

available  estimates  of  the  i    set  of  information  digits, 

namely  m!  .  =  r.  ..   By  utilizing  the  i  '  v  -  1  preceding 
i ,  J     i ,  J 

sets,  we  may  regenerate  the  code  with  a  duplicate  encoder 
and  obtain  an  estimate  of  the  transmitted  parity  digits  as: 


v-1   a 
tj  ,  =  I         Eg   .(k)r, 


1»J    q=o  k=l 


i-q,k 


a  +1  <  j  <  Y 


By  adding  t!  .  to  r.  .  (a+1  <_  j  <  y)    we  obtain  what  is 
i ,  J      i ,  J 

called  the  syndrome  s.  .. 
-* i,J 


6n 


s.  .  =  t!  .  +  r.  . 
i»J     iaJ     i>J 


v-1   a 

s.  .  =   E    E   g   .  (k)  (m.    ,  +  e.    .  ) 

i,J    q=0  k=1  &q,J      i-q,k    i-q5ky 

v-1   a 
+   E    Eg. (k)m.    .  +  e.  . 

q=0  k=l  q'J    x-q'k    X>J 


since 


v-1   a 
t .  .  =   E    E   g   .  ( k )  m .    ,  . 
i,J    q  =  0  k=l   q'J     ^^ 

Since  the  encoder  Is  linear  we  may  further  reduce  this 
syndrome  to 

v-1   a 

s.  .  =   E    E   g   . (k) (m.    ,  +  m.    ,  ) 
i,J    q=0  k=1   q>J      i-Q»k     i-q,ky 

v-1   a 
+   £    E   g   ,(k)  e.    .  +  e,  . 

q  =  0  k=l   q'J      1~q'k     ljJ 

however,  m.    .  +-m.    ,  =  0   Mod  2. 
i-q,k     i-q,k 

Therefore 

v-1   a 

s.  .=  E    Eg   .  ( k ) e .    ,+e.  .   for  a  +  1  <  j  <  y 

i,J   q=0  k=16q5J     i-q,k    i,j  -°    - 

(28) 

For  each  set  of  y  received  digits  we  will  be  able  to  form 

y  -  a  syndromes,  which  depend  only  on  the  errors  in  the 

received  sequence. 

Again  let  us  consider  the  matrix  notation  and  form  a 

column  matrix  of  syndromes.   From  equation  28  we  may  form 

S.  as 
l 


s.  = 

1 


i  ,a+l 

Si,a+2 

i,a+3 


S.,Y 


Gv-1  °  Gv-2  ° 


0  G1  0  GQ  I 


E 


Ji-v+l 
i-v+2 


E. 

l 


(29) 


Where  0  is  a(y-a) x(y-oO  null  matrix,  G.  is  defined  in  equa- 

tion  24 ,  and  I  is  an  (y-a)x(Y-a)  identity  matrix.   We  are 

now  able  to  write  the  entire  set  of  syndromes,  S ,  in  a 

fashion  similar  to  equation  25. 

Y    columns 

0      <j>      .       .       . 


s  = 


v-1 


I       cf> 
0      G 


0 


1*0 


0      Gi         0      G      I      4>      0 


G 


v-1 


V-1 
v-1 


G. 
G. 


G 

G. 


0, 


A, 


1 


E, 


Jv-1 


or 


S    =    A   E    . 

CO 

C.   THRESHOLD  DECODING 

With  these  syndromes  available,  we  turn  our  attention  to 
a  form  of  decoding  originally  introduced  by  Massey  [15] 
called  Threshold  Decoding,  in  which  the  error  decoding 
decision  is  based  on  the  value  of  the  majority  of  the 
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syndromes  corresponding  to  those  which  would  be  ' 


1'  if 


e.  .  were  actually  in  error.   From  equation  30  we  see  the 

1,J*  th 

first  set  of  syndromes  affected  by  E±    (i.e.  the  i    set 

of  error  digits)  is  S. .   Since  the  memory  span  is  v,  we 

know  the  effects  of  errors  in  E1  will  have  to  be  confined 

to  the  syndromes  S±   -  si+v_1»  hence: 


[s.        I 

1 

Go 

1 

si+l 

Gi 

0 

Si+2 



G2 

0 

E. 

l 

• 

• 

• 

_   _ 

. 

• 

• 

, 

• 

• 

Si+v-l 

v-1 

0 

— 

— 

Where  as  before  GR  is  a  (y-a)xa  matrix  whose  elements  are 
either  0  or  1,  i.e.  the  j    column  of  Gk  is 


gk,a+l(J) 
Sk,a+2(j). 

gk,a+3(j) 


gkjY  (J) 

Thus  those  syndromes,  which  will  be  affected  by  the  error 
in  the  jth  position  of  the  error  sequence  E±  will  be  those 

syndromes  for  which  gljk(J)  =  1.  <a+1  i  k  i  Y'  J  -  1   -  J'+V_1) 
In  other  words,  if  e,  .  =  1  and  no  other  errors  occur,  only 
those  syndromes  which  have  gljk(J)  =  1  wil1  be  non-zero. 
The  presence  of  other  errors  within  the  sets  of  v  -  1  error 
sequence  either  prior  to,  or  after  E.  may  alter  some  of 


£  n 


these  syndromes  to  negate  the  effect  of  the  error.   The 
decoding  proceeds  by  examining  those  syndromes  which  cor- 
responds to  e.  .  and  a  'majority  vote'  decides  on  the  value 
of  e.  .  ,  i.e.,  if  more  than  half  of  the  syndromes  are  1 
the  decision  is  that  e.  .  =  1  and  the  corresponding  infor- 
mation  digit  is  changed.   If  the  digit  is  corrected  the 
effect  of  the  error  may  be  removed  from  the  set  of  syndromes 
by  inverting  all  of  those  syndromes  used  in  the  calculation. 

This  mode  of  operation  is  referred  to  as  feedback 
decoding,  and  as  long  as  no  decoding  error  is  made,  the 
decoder  operation  is  superior  to  decoding  without  feedback 
(normally  called  definite  decoding).   On  the  other  hand, 
when  a  decoding  error  is  made,  the  eroneous  correction 
tends  to  propagate.   This  error  propagation  is  one  of  the 
acknowledged  difficulties  of  convolutional  codes.   To  avoid 
error  propagation  we  introduce  the  concept  of  a  self- 
orthogonal  (or  canonic  self-orthogonal)  code  due  to  Robinson 
and  Bernstein  [17] .   A  self-orthogonal  code  is  a  convolu- 
tional code  in  which  any  two  syndromes  which  check  e.  .  have 
at  most  one  error  digit  in  common.   Since  all  the  syndromes 
which  check  e.  .  have  e.  .in  common,  the  definition  requires 
any  other  error  digit  to  be  included  in  at  most  one  syndrome. 
Robinson  [18]  has  shown  that  a  self-orthogonal  code  has 
limited  error  propagation,  and  on  the  other  hand,  may  be 
decoded  without  feedback  with  no  error  propagation  with  only 
a  slight  loss  in  error  correction  capability. 
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To  illustrate  the  threshold  decoding  procedures,  recall 
example  1,  where 


T.  = 
l 


0  0  0  0  1 
10  0  11 
0   0   10   1 


IT; 


m 


171 


m 


i-4 

1-3 
1-2 
1-1 


m. 


By  comparison  with  equation  26  we  have 


G0  " 


:  G 


1 


;  G^ 


;  Go  - 


5  G)i  - 


th 


As  in  equation  11  the  syndromes  which  check  the  i  '  digit 
may  be  seen  from: 


'i,2 
i,3 


S. 


1  +  1,2 

Jl+1,3 
'i+2,2 

Jl+2,3 

'i  +  3,2 


S. 


i  +  3,3 


S. 


1 

1 

0 

1 

0 

1 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

'1.1 


1,2 


1,3 


i+4,2 

'i  +  4,3 


and  we  see  e.  ,  will  effect  the  svndromes  s.  ~:  s.  0: 
1,1  -         i,2'   i,3 

s.,..  n:  s.10  0:  and  s .  .  ,.  n.   With  five  svndromes,  the  major- 
1+1,2'   1+2,3        1+4,2  °         5 

ity  rule  decides  e.  ,  =  1  if  and  only  if  three  or  more  of 
these  syndromes  are  1-   If  this  set  of  five  syndromes  is 
expressed  in  terms  of  the  error  digits,  it  can  be  seen 

that  the  only  digit  in  common  between  any  two  is  e.  -,  ,  hence 

i)1 


£t- 


this  code  is  self-orthogonal.   The  decoder  is  illustrated 
in  Figure  3-3  where  as  before,  the  rectangular  boxes  are 
delay  elements  and  the  circles  are  modulo  2  adders. 

Returning  to  the  general  concept  of  threshold  decoding, 
it  should  be  pointed  out  that  the  encoder  and  decoder  are 
relatively  simple  to  implement.   To  obtain  the  maximum 
benefit  from  threshold  decoding,  the  code  should  be  self- 
orthogonal.   Robinson  and  Bernstein  [17]  have  developed  a 
synthesis  procedure,  utilizing  finite  difference  sets 
whereby  these  self-orthogonal  codes  may  be  constructed. 

Confining  our  attention  to  systematic  codes  of  rate  one 
half,  the  transmitted  digits  assume  the  form 


t .  -,  =  m.  , 
i,l     i,l 


1,2 


v-1 

E   g,  m.  , 
k=0  k  x"k 


or,    in   matrix   notation 


T.    = 

l 


:v-l    gv-2 


0      1 
gl    g0 


m 


i-v+1 


m 


m 


i-2 

i-1 


m, 


After  the  digits  have  been  transmitted  and  received,  we  may 
again  form  the  syndromes,  realizing  for  this  case  we  will 
have  only  one  syndrome  for  each  set  of  transmitted  digits. 
Forming  the  i    syndrome  by  analogy  with  equation  29 
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CD 

O 
O 


£ 

o 

•H 

-p 

rH 
O 
> 

o 
o 

(D 

■p 

ctf 

o 

Cm 

m 
0) 

O 
O 
(D 
Q 


CO 

m 

<D 

bO 
■H 
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si = 


;  i  o  g  0  o 

v-l    °v-2 


gx  0  gQ  1 


i-v+1,1 


i-v+1,2 


i-1,1 


e  . 


i-1,2 
e .  , 

e .  0 
1,2 


We  note  that  only  the  information  error  digits  are  included 
in  the  i    syndrome  calculation  except  for  the  parity  error 
digit  e.  9 ,  hence  for  convenience  we  will  isolate  the 
information  and  parity  error  digits  and  write  instead 

A 


i  +  1 


i+v-1 


Vl  %-2   '"    gl    g0    °       °      ° 

0        Sv-1    •  • 


g2    gl    g0    °       ° 


0         0 


^1^-2  *  *  *  g0 


i-v  +  1,1 


e. 


i-v+2,1 


'i+v-1, 1 


+ 


i-v+1,2 
'i-v+2,2 


i+v-1, 2 


Where  S.  is  a  column  matrix  of  the  syndromes  s.,...,s..   -, 
l  J  . i  '    '    i+v-1 

From  the  definition  of  memory  span,  the  i    message  digit 

can  only  affect  these  syndromes.   Examining  the  v    column 

of  the  matrix,  it  is  obvious  that  e.  ,  will  affect  only 

i,l 

those  syndromes  for  which  g.  =1. 

J 

For  threshold  decoding  we  examine  only  those  syndromes 

for  which  the  v    column  of  A  contains  g.  =1,  and  form  a 

J 

reduced  set  of  syndromes 
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Ri 


Sv_l  gv_2  •  •  •  gx  g0  0  0  0 


ei-v+l,l 


g 


v_l  •  •  •  Sq   ei+v-l,l 


i-v+1,2 


i+v-1,2 


By  stipulating  the  code  to  be  self-orthogonal,  we  are  assured 
that  no  column  (other  than  the  v   )  will  contain  more  than 
one  element. 

If  there  are  a  total -of  L  syndromes  which  check  e.  ,  , 
then  the  majority  logic  decides  e.  -,  =  1,  if  and  only  if  the 

1  i  -L 


numb 


L; 


er  of  non-zero  syndromes  is  greater  than  [p-T  +  1,  where 


[p-T  is  the  integer  part  of  j.   A  relatively  straight  forward 
calculation  shows  there  are  tj-(L-I)  previously  decoded  error 
positions,  p-(L-l)  error  positions  which  have  not  yet  been 
decoded,  L  parity  error  positions,  and  e.  -,  ,  which  will 

■*-  3  ■*• 

2 
effect  the  decoding,  for  a  total  of  L   +1  possible  positions 

for  errors  to  occur.   If  definite  decoding  (i.e.  no  feedback 

correction  to  the  syndromes)  is  used,  we  may  calculate  the 

probability  of  making  a  decoding  error  for  the  binary 

symmetric  channel  (where  the  probability  of  making  an  error 

is  3  independent  of  the  original  value  of  the  transmitted 

digit) . 

The  probability  of  correctly  decoding  e.  -,  is  simply  the 

probability  that  [p-t  or  fewer  errors  occur  in  the  possible 


(L   -  1)  positions,  hence 


(correct  decoding) 


■It 

i=0 


|l21+1)b1(i-p)l2+1-1 


Now 


[2T  /  2   » 
E    L  +1, 

i=o  l  i  ' 


L2  +  l 
g^l-g)^1-1  =  i-  E      (L21+1)3i(l-6)L2  +  1-1 

i=[|t+l 


and  the  probability  that  a  bit  is  decoded  in  error  is  P 


DD 


DD 


=  1  -  P 


L2+l 


(correct  decoding) 


=   Z 


■  L. 


^Vu-B)1*2*1-1 


l-[ff+l 


If  3  is  reasonably  small  this  may  be  approximated  by  the 
first  term  as 


DD 


[L2  +  l 


r— t+i      2 

B  2     d-B)L  +1  - 


[|t  +  l| 


(33) 


The  decoding  error  probability  for  feedback  decoding, 
on  the  other  hand,  is  more  difficult  to  analyze.   The  bit 
under  consideration  will  be  decoded  correctly  if  i  previous- 
ly decoded  bits  were  in  error,  provided  that  fewer  than 
[p-t  -  i  errors  occur  in  the  bits  which  have  not  yet  been 
decoded  and  the  L  syndrome  error  positions.   Hence  the 
probability  that  a  bit  is  decoded  correctly  is 


(correct  decoding) 


v   P  L 

.  n      (undecoded  errors  <  -~    -   i 
1  =  0  —  2 


decoding  errors) -P(.  decodlng  errors) 

(3*) 
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L 


For  the  conditional  probability  there  are  p-(L+l)  total 
undecoded  error  positions ,    hence,  assuming  independent 


errors 


P  L 

(undecoded  errors  <_   75-  -  i  i  decoding  errors) 


[gt-i/L(L+l) 
=   E 


L(L+1) 


3j(l-6) 


-  3 


(35) 


Designating  the  probability  of  a  decoding  error  as  P„D  (for 
feedback  decoding)  and  assuming  again  independent  errors 
within  the  p-(L-l)  previously  decoded  positions , 


(i  decoding  errors) 


L(I     l)\    .  ^(L-l)-l 

|PFD(1-PPD} 


2 
i 


(36) 


Therefore , 


(correct    decoding) 


[|t    [-t-l/LO+1 
Z         Z 

i=o  3=0 


L(L-l) 


'BJ(1- 


) 


-3 


L(L-l) 
2 

i 


L(L-l) 


P1    (1-P       ) 
FD^         FD; 


-  1 


Noting   that    1-P 


reduced   to 


(correct    decoding) 


PFD,    this   may   be 


L2J 

1   =      Z 

i=0 


[|t-l 


Z 

j  =  o 


L(L+1) 
l3J'(l-6)        2 


-  3 


rL(L-l) 
2 


FD 


L(L-l) 


-  i-1 


^    -    PFD} 


(37) 
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Although  rather  tedious,  by  hand,  the  solution  may  be 
obtained  readily  on  a  computer.   Obvious  approximations  may 
be  made,  when  B  is  sufficiently  small  so  that  P   «  1,  and. 
P   <  P    a  very  reasonable  approximation  is  obtained  by 
assuming  P^D  *  0,  and  solving  the  quadratic  equation  which 
results . 


D.   OTHER  DECODING  METHODS 
1.   Sequential  Decoding 

Another  algorithm  for  decoding  convolutional  codes 
is  sequential  decoding,  which  was  introduced  by  Wozencraft 
[19],  and  differs  from  threshold  decoding  in  that  it  is 
probabilistic  in  nature.   The  encoder  for  a  (y,a)  code  is 

viewed  as  a  directed  graph  or  tree  composed  of  nodes  and 

ex 
branches.   For  a  binary  (y»oO  code,  from  each  node  2 

branches  emanate,  and  to  each  branch  is  assigned  a  sequence 

of  y  binary  digits.   The  decoder  has  available  or  is  able 

to  calculate  the  channel  probability  characteristics,  and 

a  copy  of  the  tree  which  the  encoder  is  utilizing.   When 

the  first  few  sequences  are  received  the  decoder  proceeds 

to  hypothesize  which  sequences  were  'most  probably'  sent 

and  moves  one  node  farther  into  the  tree.   This  process 

continues  until  errors  occur,  at  which  time  the  decoder  may 

get  off  onto  the  wrong  branch.   When  this  occurs  it  is 

forced  to  make  additional  errors,  since  the  sequences  on 

the  incorrect  path  have  no  relation  to  the  sequences 

actually  transmitted.   The  decoder  will  eventually  realize 
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that  something  is  wrong,  at  which  time  it  back-tracks  to  a 
previously  visited  node  and  tries  another  branch.   If  none 
of  these  branches  are  better,  it  again  has  to  back-track  to 
yet  another  previous  node  and  search  the  branches  eminating 
from  that  node.   All  of  these  operations  take  time,  in 
addition  to  the  time  required  to  calculate  the  channel  prob- 
ability characteristics,  when  this  must  be  done.   If  there 
is  only  a  finite  amount  of  buffer  storage  available  for 
the  incoming  data,  a  long  search  might  not  be  completed 
before  the  storage  registers  are  filled,  in  which  case  a 
buffer  overflow  occurs  and  the  system  must  be  resynchronized 

Sequential  decoding  may  be  done  on  any  convolutional 
code  and,  if  the  constraint  length  is  very,  very  long,  the 
decoder  will  eventually  decode  the  transmitted  sequences 
correctly  with  probability  approaching  1. 

For  illustration  purposes  consider  a  simple  (3,1) 
systematic  convolutional  code  of  memory  span  3.   Figure  3.^ 
is  the  tree  graph  of  the  desired  code. 

At  each  node  are  two  branches  corresponding  to  the  two 
possible  values  of  the  information  digit  (i.e.  0  or  1) ,  a 
'0'  implies  the  upper  branch  while  a  '!'  implies  the  lower 
branch.   This  code  was  constructed  by  utilizing  the  proce- 
dures developed  by  Lin  and  Lyne  [20]. 

The  first  input  sub-block  (in  this  case  a  single  digit) 
determines  which  of  two  output  sub-blocks  will  be  trans- 
mitted, i.e.,  1  ->  111;  0  ->  000.   The  second  output  sequence 
will  depend  on  both  the  first  and  second  input  sequences. 
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Figure  3.4.   Tree  graph  at 
convolutional  code. 


If  the  first  input  was  '1'  and  the  second  '0',  then  the 
second  output  sequence  would  be  001.   The  third  output 
sequence,  in  a  like  fashion,  will  depend  on  the  first 
two  inputs  and  the  third  input.   The  fourth  output  sequence 
is  dependent  on  the  fourth  input  and  the  preceding  two 
inputs  -  and  so  the  process  continues.   As  an.  example ■ the 
imput  sequence  10110  would  be  mapped  into  111  001  100  110 
011.   The  encoder  for  this  code  may  be  implemented  with 
unit  delay  elements  and  modulo  2  adders  as  indicated  in 

Figure  3.5. 

In  the  decoding  procedure,  the  receiver  examines  three 
sets  of  sequences,  then  outputs  the  best  estimate  of  the 
first  digit.   For  example,  assume  the  transmitted  sequence 
were  111  001  101  111  corresponding  to  an  input  10  11, 
while  the  received  sequences  were  100  001  101  110  011. 


1U 


»  i,3 


Figure  3-5.   Encoder  for  convolutional  tree 
code  shown  in  Figure  3.^ 


The  decoder  'knows'  that  the  first  sequence  was  either 
111  or  000,  between  these  two  000  is  the  more  likely 
(assuming  a  memoryless  binary  symmetric  channel  where  one 
error  is  more  likely  than  two). 

Therefore,  the  initial  hypothesis  is  that  the  first 
digit  of  the  transmitted  sequence  is  '0',  the  decoder  then 
proceeds  to  the  ne~xt  node,  again  the  two  choices  are 
000,  and  111,  of  these  000  is  again  the  most  logical  since 
only  one  error  is  required  to  transform  000  into  001. 
Going  on  to  the  third  node  the  decoder  chooses  111  again 
with  one  error.   At  this  point  the  total  number  of  errors 
required  to  transform  the  transmitted  sequence  into  the 
received  sequence  is  three.   The  decoder  calculates  at 
this  point  (as  at  the  other  nodes)  the  set  of  channel 
probability  characteristics  and  decides  that  perhaps  three 
errors  in  the  nine  digits  is  excessive  in  which  case  it 
looks  at  the  other  branches  from  the  third  node.   The  other 
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branch  is  quickly  ruled  out  since  four  errors  would  be 
indicated.   The  decoder  then  goes  back  to  the  second  node 
and  looks  at  the  other  branch,  where  again  the  number  of 
errors  would  be  three,  so  the  decoder  continues  back  to 
the  first  node.   Again  the  first  node  lower  branch  indi- 
cates two  errors,  however,  the  decoder  goes  along  this 
path  to  the  second  node,  where  the  upper  branch  has  no 
indicated  errors.   Going  farther  into  the  tree  to  the 
third  node  there  are  again  no  additional  errors  hence, 
the  decoder  alters  its  initial  hypothesis  and  assumes  the 
first  bit  to  be  '1'.   Proceding  into  the  fourth  node  there 
are  no  indicated  errors  within  the  constraint  length,  so 
the  decoder  hypothesis  is  the  second  digit  is  '0'.   And 
so  the  process  continues,  backtracking  when  the  number  of 
of  errors  becomes  excessive. 

2 .   Viterbi  Decoding  Scheme 

Another  method  of  decoding  convolutional  codes  is 
the  Viterbi  algorithm  [21] .   Omura  [22]  has  shown  that  this 
decoding  scheme  may  be  viewed  as  a  forward  dynamic  program- 
ming solution  to  a  generalized  regulator  problem.   The  state 
of  the  encoder  at  time  k+1  is  dependent  on  the  state  at 
time  k  and  the  k   message  input,  hence  one  may  write  for 
the  state  at  time  k+1 

Vi  = A  xk  +  B  Mk 

The  output  of  the  encoder  at  time  k  is  some  function  of  the 
state  of  the  encoder  at  time  k,  hence  the  output  may  be 


7£ 


denoted  as  T.  =  Gk(Xk) .   Again  transmitting  the  encoder 
output  through  the  channel,  the  received  sequence  will  be 
r   =  T,  +  E,  .   Defining  a  performance  measure  J(T)  as 

K.       K.       r>- 

L 
J(T)  =  1  logP(R|T)  =  -  I      logP(Rk|T  ) 

k=0 

the  decoding  proceeds  by  defining  intermediate  costs  as 


fL-k(Xk}  =  min  L'1 


L 

S     logP(R.|T.) 

U-k+1      J  J 


for  each  k  =  0,1,...,L-1,  and  defining  fQ(XL)  =  0.   Since 
the  decoder  follows  the  path  with  the  minimum  cost  an  error 
will  occur  if  and  only  if  the  minimum  cost  is  incurred  on 
some  path  other  than  the  correct  path. 

E.   APPLICATION  OF  CODING  TO  CHANNEL 

The  maximum  expected  differential  time  delay  of  5  msec 
for  different  propagation  paths  requires  the  signal  to  have 
a  duration  of  about  10  msec,  which  in  turn  implies  a  maximum 
data  rate  of  about  100  bits  per  second.   For  convenience,  we 
shall  assume  that  the  data  rate  to  be  used  is  75  bits  per 
second,  which  corresponds  to  the  standard  100  word  per  minute 
teletype.   In  addition,  we  shall  use  a  rate  h   code,  and, 
since  the  encoder  outputs  two  digits  for  each  input  digit, 
the  modulation  method  will  be  frequency  division  multi- 
plexing the  information  digits  onto  one  75  bit  per  second 
subchannel  and  the  parity  digits  onto  another.   A  block 
diagram  for  the  system  is  given  in  Figure  3.6, 
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The  distributions  given  by  Tsai  [5]  were  obtained 
utilizing  a  channel  data  rate  of  75  bits  per  second,  and 
indicate  that  with  probability  0.97,  the  bursts  are  less 
than  50  bits  in  duration.   While  it  is  not  impossible  to 
implement  a  burst  error  correcting  code  to  decode  bursts 
of  length  50,  the  burst  correcting  codes  all  require  a 
minimum  guard  space  of  error  free  digits.   This  error  free 
guard  space  is  not  generally  found  in  the  HF  channel. 

Since  burst  correcting  codes  are  not  desirable,  the 
alternative  is  to  modify  the  distribution  of  the  bursts. 
This  burst  modification  may  be  performed  by  a  process 
known  as  interleaving.   Interleaving  is  simply  a  time 
ordering  of  the  transmitted  digits  so  that  adjacent  errors 
in  the  channel  are  spread  out  in  the  received  sequence 
prior  to  decoding.   (Some  forms  of  interleaving  may  be 
viewed  as  time  multiplexing.)   Interleaving  by  k  bits  may 
be  accomplished  by  many  methods,  however,  one  of  the 
simplest  is  to  replace  each  delay  element  in  both  the 
encoder  and  decoder  by  a  k  bit  shift  register.   This  has 
the  effect  of  inserting  k  columns  of  zeros  between  the 
columns  in  the  generator  matrix,  Gra,  of  the  code,  and 
correspondingly  k  rows  of  zeros  between  each  of  the  rows 
in  the  syndrome  matrix,  A^.   Any  burst  of  errors  that  is  k 
digits  or  shorter  in  length  will  not  affect  more  than  one 
of  the  digits  in  each  of  k  succeeding  decoding  steps.   On 
the  other  hand,  there  may  be  errors  from  more  than  one 
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burst  included  in  the  decoding  calculation.   The  inter- 
leaving cannot  alter  the  average  bit  error  probability, 
all  it  does  is  rearrange  the  errors  so  they  occur  in  a 
more  evenly  distributed  fashion.   For  the  data  presented 
by  Tsai,  interleaving  by  50  bits  would  then  insure  that 
fewer  than  two  error  digits  from  a  particular  burst  would 
be  contained  in  the  digits  utilized   for  any  decoding 
operation  with  probability  0.97. 

The  amount  of  interleaving  required  is  dependent  on 
the  duration  and  frequency  of  the  bursts,  and  the  constraint 
length  and  correction  capabilities  of  the  code  being  used. 
As  the  amount  of  interleaving  is  increased  the  channel  will 
appear  more  and  more  random,  however,  it  must  be  noted  that 
the  interleaved  data  will  be  extremely  susceptible  to 
periodic  interference  unless  care  is  used  in  choosing  the 
interleaving.   (For  example  &h   bit  interleaving  would  not 
be  desirable  for  use  with  a  modulator  utilizing  32  sub- 
channels, since  occasional  modulator  synchronization  errors 
would  be  related  to  the  interleaving.   This  would  have  a 
much  more  serious  effect  on  the  proper  decoding  than  if  the 
interleaving  were  chosen  to  be  63  or  65  bits.) 

When  attempting  to  ascertain  the  degree  of  interleaving 
required,  actual  channel  characteristics  or  measured  error 
patterns  are  utilized.   For  a  given  set  of  received  digits, 
varying  amounts  of  interleaving  may  be  tried  to  determine 
when  the  errors  are  "random  enough."   To  determine  what 
"random  enough"  implies,  consider  a  set  of  data  for  which 


the  average  bit  error  probability  is  •$ .   If  the  constraint 

length  of  the  code  is  N  and  the  code  is  interleaved  by  I 

bits,  then  the  effective  constraint  length  will  be  N'l,  and 

the  decoding  will  be  performed  by  examining  this  effectivs 

constraint  length.   Although  the  total  number  of  errors  is 

still  the  same,  the  ratio  of  the  number  of  errors  per  block 

approaches  B,  since,  by  the  law  of  large  numbers,  the 

probability  of  having  more  than  6  NI  errors  in  NI  digits 

approaches  zero  as  NI  increases  indefinitely.   In  general 

this  would  imply  an  excessively  large  amount  of  interleaving, 

since  the  trend  may  be  rather  slow. 

Another  criterion  is  available  to  determine  a  reasonable 

estimate  for  the  amount  of  interleaving.   For  any  given 

code,  the  ratio  of  the  number  of  errors  correctable  to  the 

number  of  possible  error  positions  may  be  obtained.   For 

the  definite  decoding  procedure  of  the  preceding  section 

this  is   p  ■   .   The  decoder  will  function  properly  (or  at 

L  +1 
least  with  some  improvement)  until  the  ratio  of  the  number 

of  errors  to  the  number  of  possible  error  positions  reaches 
Hence  to  determine  a  practical  interleaving  degree, 


p 

L  +1 


choose  I  so  that  the  ratio  of  the  number  of  blocks  of 

length  NI  with  more  than  —5 NI  errors,  to  the  total  number 

IT+1 
of  blocks  Is  approximately  equal  to  the  probability  of  a 

decoding  error  for  a  completely  random  channel  with 

the  error  probability  6.   For  convenience  we  will  use  the 

definite  decoding  error  probability  PDD-   The  term 
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approximately  above  is  used  loosely  since  the  effectiveness 
of  the  interleaving  is  extremely  sensitive  to  the  actual 
bursts  received. 

To  illustrate  the  procedure,  consider  a  particular  code 
capable  of  correcting  two  errors,  with  constraint  length  18 
as  given  by : 


T.  = 
1 


t.  , 
ti,2 


000000001 
100010110 


mi-8 
mi-7 


m, 


Interleaving  by  I  bits  will  simply  insert  I  columns  of  zeros 
between  each  of  the  columns  of  the  generator  matrix  above, 
and  replace  m.  .  by  m.  ,T,   The  reduced  set  of  syndromes  on 
which  the  threshold  decoding  is  based  are: 


Ri 


(i  +  D 
(1+21) 
(i+4l) 
(1+81) 


100010110000000 
010001011000000 
000100010110000 
000000010001011 


(1-81), 1 


1,1 


e(i+ll),2 
e(i  +  2I)  ,2 
e(i  +  4l)  ,2 
e(i+8l),2 


(1+81), 1 

Again  I  rows  of  zeros  are  to  be  inserted  between  each  of  the 
rows  of  the  multiplying  matrix  above.   For  this  code  there 
are  a  total  of  17  possible  error  positions  within  which  2 
errors  may  be  corrected  hence  the  ratio  of  correctable  errors 
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p 
to  total  error  positions  is  yy   Prom  equation  33,  the  de- 
coding error  probability,  PDD>  may  be  approximated  as 

PDD(6)  -(17)  B3(1.B)1'. 

=  68033  . 

We  now  turn  our  attention  to  the  actual  error  data,  as 
contained  in  reference  23.   We  will  consider  three  typical 

cases  of  measured  error  statistics  where  the  average  error 

_2 
rate  6  is  about  2.5x10   .   This  implies  that  PDD  is  about 

5x10   .   Interleaving  by  I  =  27  bits  results  in  the  number 

2 
of  blocks  with  more  than  yf   (l8)(27)  equal  to  95  out  of  a 

total  of  63675  blocks  for  a  'residual'  uncorrectable  error 
rate  of  1.5x10   .   Interleaving  by  5^  bits  lowers  this  to 
about  1.2x10   ,  while  interleaving  by  10 ^4  bits  only  brings 
it  down  to  7-86x10"  .   Continuing  to  increase  the  inter- 
leaving to  the  point  where  I  =  855  only  changes  the  residual 
error  rate  to  5.9x10   .   This  is  presented  in  tabulation 
form  relative  to  NI  in  Table  III.   Again  it  should  be 
mentioned  that  the  effectiveness  of  the  interleaving  is 
strongly  dependent  on  the  actual  characteristics  of  the 
bursts.   (This  information  is  not  available  for  the  data 
presented.)   It  appears  as  though  a  conservative  amount  of 
interleaving  would  be  5^  bits  for  the  code  under  considera- 
tion, since  for  this  value,  the  code  would  correct  many  of 
the  channel  errors,  and  still  be  reasonably  practical. 
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Although  the  code  presented  here  is  not  a  very  powerful 
one,  as  compared  with  some  of  the  block  codes  or  longer 
convolutional  codes,  the  ratio  of  correctable  errors  to 
total  possible  errors  is  nearly  as  good  as  the  Golay  block 
code  -  which  is  capable  of  correcting  3  errors  -  (jy  as 
compared  with  ^-)  .   In  addition,  the  simple  syndrome 
decoding  procedure  is  flexible  enough  so  that  by  monitoring 
the  syndrome  register,  the  mode  of  decoding  (definite,  or 
feedback)  could  be  determined  -  choosing  feedback  decoding 
during  periods  when  there  are  relatively  few  errors  indi- 
cated in  the  syndrome  register  and  switching  to  definite 
decoding  when  the  number  of  indicated  errors  could  imply 
possible  error  propagation.   This  would  give  a  higher  out- 
put error  probability  during  definite  decoding,  however 
there  would  be  no  error  propagation. 

For  the  feedback  decoding,  and  reasonably  small  3, 

3 
P    is  at  least  as  small  as  PDD,  and  we  assume  PpD  -   0. 

From  equation  35 

P( channel  errors  £  2 | 0  decoding  errors )-l  -[  3  J  P  =1-1653 

P(channel  errors<l|l  decoding  error)  1  -  2  3  (1-3)  =1-556  +  99^ 

1  ~\  2      3 

P(channel  errors=0 | 2  decoding  error) (1-3)   =1-13+556  -1653 

and  from  equation  36: 

6  2 

P(0  decoding  errors)  =  (1-Ppr))   =  1  -  6PpD  +  15PpD 

5  2 

P(l  decoding  error)  =  6PFr)(  l-PpD)   =  6PpD  -  30PpD 


2 

P(2  decoding  errors)  =  15P 


FD 


8^ 


Substituting  these  expressions  into  equation  38  and  solving 
the  resulting  quadratic  equation  yields 


P      1  +  330B2-15W3 
FD  30(-ll$+l65B2-5l8B3 


L  _  2   l65B3-(llB+l65B2  +  51gB3)3'c\ 
^       (1+330B2-1584B3)2     J 


By  expanding  the  square  root  and  retaining  only  the  first 

4  2 
two  terms  (since  the  third  term  involves  ((B  )  )>PFD  becomes 

1+330B2. 1584S3    \/,  J,    l65B3x30(-llS+l65g2  +  5l8B3y 


1  ±{1  2    ie Qho3,2 


FD  ^30(-llB  +  l65B2-5l8B3-)j\    \  (1+3308^-158^) 

The  minus  sign  is  to  be  used,  since  the  positive  sign  results 
in  a  probability  greater  than  1  for  small  values  of  B;  and 
PTrn  may  be  simplified  to 

„  l65sl_ 
rFD 


1+330B2 


A  comparison  of  the  decoding  error  probabilities  is  given 
in  Table  II. 


TABLE  II 

6 

p 
rDD 

p 
FD 

io-2 

6.80x10 

1.65x10 

ID"3 

6.80xl0~7 

1.65xl0~7 

IO"" 

6.80xl0"10 

1.65xlO~10 

It  must  be  noted  that  these  calculations  are  for  a  com- 
pletely random  channel,  however  by  interleaving  by  5^  or 


85 


more  bits  they  should  be  within  an  order  of  magnitude  of 
the  observed  statistics,  again  depending  to  a  large  extent 
on  the  actual  distribution  of  errors. 

The  discussion  of  effectiveness  of  coding  must  include 
some  mention  of  block  coding  performance.   One  of  the  better 
block  codes  is  the  modified  Golay  [24,  12]  code,  which  will 
correct  3  errors  within  any  24  bit  code  word.   For  reasonably 
small  6  the  probability  of  a  decoding  error  P^p  is  approx- 
imately given  as 


p    -   2* 


^ 


a  106263^ 

The  error  probability  Ppr,  is  not  a  bit  error  probability, 
but  the  probability  for  a  2k   bit  word,  or  equivalently  a 
12  bit  information  word.   To  be  comparable  with  the  bit 
error  probability  obtained  for  the  convolutional  code,  this 
word  error  probability  must  be  converted  to  a  bit  error 
probability  [Ref .  23] .   For  the  degrees  of  interleaving 
utilized  for  the  convolutional  code  these  bit  error  prob- 
abilities are  listed  in  Table  III. 

As  can  be  seen  from  Table  III,  the  codes  are  essentially 
comparable  with  the  Golay  code  being  slightly  superior  in 
performance,  while  the  convolutional  code  may  be  simpler  to 
implement  in  hardware. 

The  implementation  of  the  code  is  straight  forward,  the 
encoder  is  shown  In  Figure  3-7  and  the  decoder  in  Figure  3.?. 
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TABLE  III 

A  COMPARISON  OF  BIT  ERROR  PROBABILITIES  FOR 
CONVOLUTIONAL  CODE  AND  MODIFIED  GOLAY  CODE, 
CHANNEL  ERROR  PROBABILITY,  IS  APPROXIMATELY  2x10  d 


Decoded  Bit  Decoded  Bit 

NI     Error  Probability      Error  Probability 
Convolutional  Code    Modified  Golay  Code 

-4 
5.3x10 

3.4xl0_Z| 

-k 
2.3x10 

-4 
1.1x10 


480 

1.5xl0"3 

960 

1.2xl0~3 

1920 

7.86xlO~4 

15360 

5.9x10 

In  this  case,  each  rectangular  box  represents  a  5^  stage 
shift  register.   The  mode  selector  examines  the  general 
state  of  the  syndrome  shift  registers   and  if  there  are 
numerous  indicated  errors  the  mode  is  changed  from  feed- 
back decoding  to  definite  decoding,  the  decoder  remains 
in  the  definite  decoding  mode  until  such  time  as  there  are 
relatively  few  indicated  errors  when  the  decoder  is  returned 
to  the  feedback  mode. 

F.   CONCLUSIONS 

The  improvement  of  the  reliability  of  the  HF  communica- 
tion channel  through  the  use  of  error  correction  coding  has 
been  the  major  concern  of  this  thesis.   The  average  bit 
error  probability  is  determined  by  the  number  and  duration 
of  fades,  and  atmospheric  noise.   The  fading  is  assumed  to 
be  an  interference  phenomena  resulting  from  the  existence 
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of  multiple  propagation  paths,  and  lasting  for  periods  of 

approximately  one  second.   The  atmospheric  noise  contributes 

random  errors  to  the  normally  burst  characteristics  produced 

by  the  fades. 

It  is  demonstrated  that  simple  convolutional  codes 
i 
utilized  with  moderate  amounts  of  interleaving  can  obtain 

approximately  an  order  of  magnitude  reduction  in  the  output 
bit  error  probability.   In  addition,  the  relatively  simple 
decoding  scheme  may  be  modified  during  reception  to  elimin- 
ate the  possibility  of  error  propagation,  with  only  a 
relatively  minor  loss  in  the  error  correction  capability  of 
the  code . 

In  order  to  fully  utilize  the  channel  other  procedures 
should  be  followed,  namely  space  diversity,  where  two  or 
more  separate  receivers  are  utilized  with  antennas  in 
physically  different  locations.   This  decreases  the  effect 
of  fading  resulting  from  the  multipath  conditions  since 
the  fading  at  different  locations  is  generally  uncorrelated . 
In  addition,  frequency  diversity  (when  possible)  should  be 
employed,  since  the  fades  are  unrelated  for  carrier  frequen- 
cies separated  by  more  than  50  KHz. 

Since  the  normal  allocated  bandwidth  in  the  HF  spectrum 
is  3  KHz,  many  separate  channels  may  be  frequency  division 
multiplexed  onto  the  same  carrier  to  achieve  a  higher  data 
rate  on  the  channel.   Existing  frequency  division  multiplex 
equipments  have  a  32  channel  capability,  with  the  sub- 
channels each  operating  at  a  75  bit  per  second  rate  for  a 
maximum  data  rate  of  2400  bits  per  second.   The  error 

cm 


correction  coding  will  require  a  sacrifice  in  the  data  rate 
on  the  channel  to  provide  a  reduction  in  the  output  error 
rate.   By  using  a  rate  %   code  the  information  rate  through  . 
the  channel  will  be  reduced  to  1200  bits  per  second,  16 
subchannels  may  then  be  used  for  the  information  bits  and 
the  remaining  16  subchannels  for  the  parity  bits.   One 
method  of  implementing  such  a  system  indicated  in  Figure 
3.9. 

G .   RECOMMENDATIONS  FOR  FUTURE  RESEARCH 

There  are  three  primary  areas  where  considerable  research 
could  be  applied:  these  are  1)  the  characterization  of  the 
HF  channel,  2)  the  modeling  of  man-made  noise  and  inter- 
ference, and  3)  the  decoding  of  convolutional  codes. 

In  the  characterization  of  the  HF  channel  the  use  of  the 
Markov  chain  model  is  most  promising,  however,  the  principal 
source  of  difficulty  is  the  non-availability  of  data.   The 
data  analyzed  has  been  obtained  from  actual  channel  measure- 
ments which  may  have  been  perturbed  by  categroically  diffrent 
propagation  conditions.   To  avoid  these  interpretations,  it 
is  recommended  that  existing  ionospheric  channel  simulators 
(24)  be  used  to  obtain  characteristics  which  are  dependent 
only  on  multipath  phenomena.   Actual  channel  data  could  then 
be  compared  with  that  from  the  simulator  to  determine  other 
sources  of  perturbation. 

In  the  area  of  impulsive  man-made  noise  or  signal  inter- 
ference maximum  use  could  be  made  of  the  Hall  model.   The 
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Hall  model  was  discussed  in  this  thesis  only  for  the  ampli- 
tude characteristics  of  atmospheric  noise.   However,  its 
principal  advantage  lies  in  the  fact  that  statistics  such 
as  pulse  interval  and  pulse  duration  may  be  correctly 
obtained  by  proper  specification  of  the  covariance  function 
of  the  modulating  process,  A(t),  as  described  on  page  28. 
This  would  be  a  valuable  tool  in  the  design  of  receivers 
for  use  in  areas  where  unavoidable  noise  sources  (such  as 
powerline  transients)  cause  excessive  error  rates  with 
conventional  receivers. 

One  of  the  most  important  fields  for  additional  research 
is  in  the  area  of  decoding  algorithms  for  convolutional  codes. 
In  general,  there  are  some  codes  with  desirable  distance 
parameters,  but  good  general  decoding  algorithms  are  not 
available.   Threshold  decoding,  while  relatively  easy  to 
implement,  Is  not  applicable  to  all  codes.   Sequential 
decoding,  on  the  5ther  hand,  will  work  with  any  convolutional 
code,  but  it  will  not  work  well  on  channels  where  there  is 
correlation  between  errors,  i.e.,  burst  channels.   While  the 
Viterbi  decoding  algorithm  is  a  maximum  likelihood  decoding 
procedure,  it  is  limited  to  codes  of  relatively  short  con- 
straint length  by  the  storage  requirements.   In  short,  there 
are  no  known  classes  of  convolutional  codes  with  decoding 
procedures  comparable  with  the  longer  Bose-Chaudhuri- 
Hocquenghem  block  codes. 
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